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The number of solutions of <fi (x) = m 

By Kevin Ford 
Abstract 

An old conjecture of Sierpiriski asserts that for every integer k > 2, there 
is a number m for which the equation 4>(x) = m has exactly k solutions. Here 
(j) is Euler's totient function. In 1961, Schinzel deduced this conjecture from his 
Hypothesis H. The purpose of this paper is to present an unconditional proof 
of Sierpihski's conjecture. The proof uses many results from sieve theory, in 
particular the famous theorem of Chen. 

1. Introduction 

Of fundamental importance in the theory of numbers is Euler's totient 
function 4>{n). Two famous unsolved problems concern the possible values 
of the function A(m), the number of solutions of (ft(x) = m, also called the 
multiplicity of m. Carmichael's Conjecture ([1],[2]) states that for every m, 
the equation (f)(x) = m has either no solutions x or at least two solutions. In 
other words, no totient can have multiplicity 1. Although this remains an open 
problem, very large lower bounds for a possible counterexample are relatively 
easy to obtain, the latest being 10 10 ([10, Th. 6]). Recently, the author has 
also shown that either Carmichael's conjecture is true or a positive proportion 
of all values m of Euler's function give A(m) = 1 ([10, Th. 2]). In the 1950's, 
Sierpihski conjectured that all multiplicities k > 2 are possible (see [9] and 
[16]), and in 1961 Schinzel [17] deduced this conjecture from his well-known 
Hypothesis H [18] . 

Schinzel's Hypothesis H. Suppose fi(n), . . . , fk(n) are irreducible, 
integer-valued polynomials (for integral n) with positive leading coefficients. 
Also suppose that for every prime q, q \ f\(n) ■ ■ ■ fk(n) for some n. Then the 
numbers fi(n), . . . , fk(n) are simultaneously prime for infinitely many positive 
integers n. 

Hypothesis H is a generalization of Dickson's prime A;-tuples conjecture [7], 
which covers the special case when each /j is linear. Schinzel's construction of 
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a number with multiplicity k requires that as many as k distinct polynomials 
with degrees as large as 3k simultaneously represent primes. Using a different, 
iterative argument, the author ([10, Th. 9]) deduced Sierpihski's Conjecture 
from the Prime /c-tuples Conjecture, the argument requiring only three linear 
polynomials to be simultaneously prime. The proof is quite simple, and we 
reproduce it here. 

Lemma 1.1. Suppose A(m) = k and p is a prime satisfying 

(i) p > 2m + 1, 

(ii) 2p + 1 and 2mp + 1 are prime, 

(iii) dp + 1 is composite for all d \ 2m except d = 2 and d = 2m. 
Then A(2mp) = k + 2. 

Proof. Suppose 4>(xi) = • • • = 4>{xk) = m and <p(x) = 2mp. Condition (i) 
implies p \ x, hence p\ (q — 1) for some prime q dividing x. Since (q — 1) | 2mp, 
we have q = dp + 1 for some d \ 2m. We have q > 2p, so q 2 \ x and thus 
4>(x) = (q — l)(p(x/q). By conditions (ii) and (iii), either q = 2p + 1 or 
q = 2mp+l. In the former case, (p(x/q) = m, which has solutions x = (2p+l)xi 
(1 < i < k). In the latter case, <j){x/q) = 1, which has solutions x = q and 
x = 2q. □ 

Now suppose A(m) = k and let d\,... ,dj be the divisors of 2m with 
3 ^ di < 2m. Let pi, ■ ■ ■ ,Pj be distinct primes satisfying pi > di for each i. 
Using the Chinese Remainder Theorem, let a mod b denote the intersection 
of the residue classes — d" 1 mod pi (1 < i < j). Then for every h and i, 
(a + bh)di + 1 is divisible by pi, hence composite for large enough h. The 
Prime /c-tuples Conjecture implies that there are infinitely many numbers h so 
that p = a + hb, and 2p+l and 2mp + 1 are simultaneously prime. By Lemma 
1.1, A(2mp) = k + 2 for each such p. Starting with A(l) = 2 and A(2) = 3, 
Sierpihski's Conjecture follows by induction on k. 

Although Hypothesis H has not been proved in even the simplest case of 
two linear polynomials (generalized twin primes), sieve methods have shown 
the conclusion to hold if the numbers /i(n), . . . , fk{ n ) are allowed to be primes 
or "almost primes" (nonprimes with few prime factors). See [13] for specific 
statements. Recently, the author and S. Konyagin [11] employed the theory of 
almost primes to prove that A(m) = k is solvable for every even k. Although 
falling short of a complete proof of Sierpihski's Conjecture, the method did 
provide a solution to the corresponding problem for the sum of divisors function 
a(n). It is proved ([11], Theorem 1) that for every k ^ 0, there is a number 
m for which a(x) = m has exactly k solutions, settling another conjecture 
of Sierpihski. Curiously, the barrier to solving A(m) = k for odd k by this 
method is the probable nonexistence of a number with A{m) = 1. 
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Combining a different application of almost primes with the iterative ap- 
proach from [10], we now provide an unconditional proof of Sierpihski's Con- 
jecture. 

Theorem 1. For each integer k > 2, there is an integer m with A{m) = k. 

Our proof of Theorem 1 is by induction on k. We start with -4(10) = 2, 
A{2) = 3 and for the induction step, we show: 

Theorem 2. Suppose A(m) = k and m is even. Then there is a number 
m! such that A(mm') = k + 2. 

Combining Theorem 1 with Theorem 2 of [10] gives a strong result on the 
distribution of totients (values taken by <fi(n)) with a given multiplicity. 

Corollary 3. For each k ^ 2, a positive proportion of all totients have 
multiplicity k. Specifically, ifV(x) is the number of totients < x, and V^{x) is 
the number of totients < x with multiplicity k, then for each k > 2, 

liming >0. 

x^oo V [X) 



At the center of the proof of Theorem 2 is a famous theorem of Chen ( [3] , 

[4])- 

Lemma 1.2 (Chen's theorem). For each even natural number m and 
x > xo(m), there are S> x/log 2 x primes s G (x/2,x], s = 1 (mod m) such 
that (s — l)/m has at most two prime factors, each of which exceeds x 1 / 10 . 

Remark. A proof of Chen's Theorem also appears in Chapter 11 of [13]. 
What Chen actually proved is that there are 3> xj log 2 x primes p < x with 
p + 2 having at most two prime factors. Trivial modifications to the proof give 
Lemma 1.2. 

Hypothesis S(m). For some constants c > and xo, if x > xo then the 
number of primes s G (x/2, x] for which (s—l)/m is also prime is ^ cx/ log 2 x. 

We suppose cf)(a) = m has the k solutions a±, . . . , a^. The proof is then 
broken into two parts: (I) assuming Hypothesis S(m) is true; (II) assuming 
Hypothesis S(m) is false. For Part I we show that for many such primes s, 
A(mq(q — 1)) = k + 2, where q = (s — l)/m and <j){n) = mq(q — 1) has the 
k + 2 solutions n = q 2 ai (i = 1, . . . , k), n = q(mq + 1) and n = 2q(mq + 1) 
(the "trivial" solutions). 

For Part II, Chen's Theorem tells us that for an unbounded set of x, 
there are 3> x/log 2 x primes s G (x/2,x] with s = 1 + mpq, where p and 
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q are primes and x 1 / 10 ^ p < q ^ a; 9 / 10 . We show that for many such s, 
A(mpq(p — l)(q — 1)) = k + 2, the solutions of 

(1.1) <^>(n) = mpq(p — l)(q — 1) 

being only the "trivial" ones n = p 2 q 2 a>i (i = 1, . . . , k), n = pq{mpq + 1) and 
n = 2pq{mpq + 1). Proving this is more difficult than Part I, but a simple 
heuristic indicates that such s are likely. The basis for the heuristic (and the 
actual proof) is an extension of the classical result of Hardy and Ramanujan 
[14] that most integers n ^ x have about log log x prime factors. Erdos [8] 
combined their method with Brun's sieve to show that for most primes p ^ x, 
p — l has about log log x prime factors. One can further refine these ideas (e.g. 
Lemma 2.8 of [10] or Lemma 4.3 below) to show that for most primes p < x, 
the prime factors of p — 1 are nicely distributed. This means that p — l has 
about log log b — log log a prime factors in the interval [a, b] for 1 ^ a ^ b ^ x. 
Call such primes "normal" . If p and q are both normal primes and x is large, 
then w = p(p — l)q(q — l)m has about 2 log log x prime factors. Fix t and 
suppose n has t distinct prime factors r±, . . . rt, different from p and q. For 
simplicity suppose n is square-free. Then 

cj)(n) = (n -!)■■■ (r t - 1) = w. 

For a given factorization w = d\ ■ ■ ■ dt, the probability that d\ + 1, . . . , d t + 1 
are all prime is about ((logdi) • • • (logc^)) -1 . The critical case is t = 2. Since 
p and q are normal, w has about 2 log logy prime factors ^ y, and thus has 
about (logy) 21og2 divisors ^ y. By partial summation, 

^ (logdi)(logd 2 ) 

w=d\d2 
di^2 

the right side representing the probability that (1.1) has a nontrivial solution 
for a given pair (p, q). Similarly, it can be shown that 

^ (logdi)---(log^) 

w=d\—dt 
di^2 

the bulk of the sum coming from taking d±, . . . , d t ~2 ah small. 

The biggest practical problem with these arguments is the fact that 4>{n) 
is quadratic in q for Part I and quadratic in both p and q for Part II. Thus, 
when certain sieve estimates are employed, the bounds have factors of the form 

p(D, Z )=n 
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where (— ) is the Legendre symbol. Heuristically, such products should be 
O(l) on average, and showing this for a wide range of the parameters D and z 
requires strong results on the distribution of primes in arithmetic progressions 
(see (2.1) and Lemmas 2.6 - 2.8 and 4.2 below). 

In the next section, various lemmas needed in the proof of both parts are 
presented. Section 3 is devoted to the proof in Part I. Section 4 contains further 
lemmas needed for Part II, and the argument counting nontrivial solutions of 
(1.1) with (n,pq) > 1. The last section deals with the hardest case, namely 
counting solutions of (1.1) with (n,pq) = 1. 

The following notation will be used throughout. Let P + (n) denote the 
largest prime factor of n and P~(n) denote the smallest prime factor of n. 
The functions co(n) and f2(n) count the number of distinct prime factors of 
n, respectively the number of prime factors counted according to multiplicity. 
Let £l(n, U, T) denote the number of prime factors p of n such that U ^ p < T, 
counted according to multiplicity. Let (^) denote the Legendre-Jacobi symbol 
and let r(n) be the number of positive divisors of n. Constants implied by the 
Landau O—, Vinogradov <C — and 3> — symbols may depend on m, but not 
on any other parameter unless indicated. The symbols p, q, and w will always 
denote primes. The symbol m will always refer to the number in Theorem 2. 
The cardinality of a set S will be denoted as #5 or \S\. 



2. Preliminary lemmas 

Aside from Chen's Theorem, the other sieve results we require all follow 
from a basic sieve inequality called the "Fundamental Lemma" of the combi- 
natorial sieve (Theorem 2.5 of [13]). 

Lemma 2.1. Suppose F is a polynomial of degree h ^ 1 with integer 
coefficients and positive leading coefficient. Denote by g(zu) the number of 
solutions of F(n) = (mod w). For 2 < z < x, 

#{n^x: P-(F(n)) > z} = x ~[[ (l - 

x ^ e -\/log^ _|_ e -M(logu-loglog3«-log/i-2) x | ^ 

where u = log xj log z. 

Proof. If q{w) = w for some prime w < z, then the left side is zero. 
Otherwise the lemma follows from [13, Th. 2.6] and following remarks. □ 
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Remark. The error term is 0^(1), and this will suffice for most of our 
applications. Also, for future reference, we note that for w \ 2a, 

(2.1) #{n : an 2 + bn + c = (mod w)} = 1 + 

Lemma 2.2. Suppose 

F(n) = (ain + &i) ■ ■ ■ (a h n + b h ), 
where the ai are positive integers and the bi are integers. Suppose 
E := ai ■ ■ ■ a h [] (ajftj - ajbi) / 0. 

If 2 < z ^ x and log E < log 3 x, then 

#{n< X :P-(F(n))>z}<< h ^ h ^y 

<.h x{\og\ogx) h (\ogz)~ h . 

Proof. In the notation of Lemma 2.1, we have q(w) = h when w \ E; 
therefore the number of n in question is 

vj\E 

The last part of the lemma follows from the inequality E/<f>(E) <C log log E. □ 

Lemma 2.3. Let V(x) be the number of distinct values of (j)(n) which are 
< x. Then for every e > 0, 

V ^ n X M-e - 

(logx) 1 6 

Proof. This is the main theorem of [8]. □ 

LEMMA 2.4. The number of n ^ x divisible by a number s satisfying 
s > exp{(loglogx) 2 } and P+(s) < s w / lo ^ x is <C x/log 10 x. 

Proof. For exp{(loglogx) 2 } < y < x, let G(y) be the number of s < y with 
P+(s) ^ s io/iogioga;^ n (i eno te a number with no prime factor exceeding 
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W : !j Ul ; "*' and set a = 1 - 20 fe^ . Then 



oo 



n=l 



(logx) 



20 



(iogx; 



20 



exp 



E^~ 



By the prime number theorem, 



X>-«,a 



2,10 e^x 

(logs/)- 1 M 



sc (l + o(l))(e 200 + log logy) s$ 2 log logx 

for large x. Thus <C y/(logx) 18 and the lemma follows by partial sum- 
mation. For a survey of results on the distribution of numbers without large 
prime factors, see [15]. □ 

The next four lemmas are needed to bound the product P(D, z) mentioned 
in the introduction. 

Lemma 2.5. When (a,n) = 1, n is odd and square-free, 

'aj + 6 N 



E 

<J,n)=l 



n 



When (ab, n) = 1 and n is square-free. 

E (i u " 3 + " 



n / \ n 



1. 



Proof. Denote by /(n; a, b) the first sum in the lemma. The desired bound 
follows from the identities f(n;a,b) = — (^) for prime n, and 

/(nin 2 ; a, 6) = f(n 1 ;an 2 , b)f(n 2 ; an ± , b) 

when (a, n\n?) = (711,712) = 1. The second assertion follows in a similar 
fashion. Here we adopt the convention that (j) = 1 for all a. □ 

Lemma 2.6. For some absolute constant yo, if y > yo, 1 < c < 2, i£ ^ 1, 
anc? .D € Z, i/ien 

nK (§))'« s ^(f)- 

ro^j/ n ^^loglogj/ 
^ E (n,2£)=l 
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Proof. If D = 0, the left side is 1 and the right side is 4, so suppose D / 0. 
For 1 sc c 2 and x ^ -1/3, (1 + x) c < (1 + cx)(l + 3x 2 ); thus 

nH(§)Hn(i + J0 n H(§ 



vo\E 



4 11 V tu 2 / ^— ' n \ n 

(n,2E)=l 

<3.92 £ _^i(_j +3 . 92 £ 



c ^(n) 



n 

P+(rt)<j/ ' n>J/ loglogj/ 

(n,2B)=l P+(n)sSj/ 

nsSj/ loglogy 

Since < 7"(n), by Lemma 2.4 and partial summation, we obtain 

V — - < V — V — V — 

n>j/ lo s lo s v did 2 >y loeloey d 2 >y (l ° e log v)/2 d i^ d 2 

P+(n)^y P+(did 2 )^y P+(d 2 )^y 



« E 



log d 2 



d 2 

d 2 >y {l °z lo sf)/ 2 

P+(rf 2 ) 5 Srf2 /1 ° SlOSd2 

< (logy)" 8 . 

As the left side in the lemma is » (logy) _c , the desired bound follows for 
sufficiently large y. In particular, the sum in the lemma is positive and 3> 
(logy)~ c . □ 

Lemma 2.7. Suppose y ^ y , U ^ yio(iogiogy) 2 ^ ^ B and R are pos m ve 
integers with R even and log(ABR) ^ log 3 U. Suppose D(s) = A(A — Bs), 
D(s) = As(As - B) or D(s) = A±Bs. Uniformly in A, B, R and 1 sc c s$ 2, 

£ n f 1 - - ( D{s) )Y « c/(ioglogC7)1+c 

s prime 



w \ w J J log U 



^ n ( 1 - 1 ( —) ) c « c/(log log U)A+C 



s,sR+l prime 



w \ w j j (log uy 
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Proof. Let z = [/Viogiogt/. jf £>( s ) = A(A - Bs) or D(s) = As(As - 77), 
set E = AB; otherwise set E = B. By imposing the addition condition 
that w \ E, we see that the products on the left are only increased when 
D(s) = A(A - Bs) or D(s) = As(As - 77), and in the case D(s) = A + Bs the 
product is changed by a factor 

n f 1 - ^ (£)) « aogiogi?) c « (iogio g t/) c . 

We will also replace the conditions that "s prime" and "s,sR+ 1 prime", 
with P~(s) > z, respectively P~(s(sR + 1)) > z. This will eliminate at 
most z values of s from the sums on the left, reducing the sums by at most 
Oology) 2 ) = O e (U £ ). Let ^ denote the set of s < U with P~(s) > z and 
let y 2 denote the s < U with P~(s(sR + 1)) > z. By Lemma 2.6, 



zu \ w J J I —r — ' n \ n 

ro t-E (n,2B)=l 

P+(n)<y 



^W^/ ) 
^— ' n 

n ^ y lo g lo g y 

(n,2B)=l 



£>(«) 



n 



Denote by S^J^; n, 6) the number of elements of the set which lie in the 
residue class b mod n. By the sieve fundamental lemma (Lemma 2.1), and the 
fact that n < z 1 / 5 , uniformly in (b,n) = 1 we have 

^(i + omogin- 10 ^ 

n 

where 

n log log J7 



n, 6) = — (l + 0((log lO' 10 )) , 
= J] (1 - « 

137<i 

For each n, Lemma 2.5 gives 



4>(n) log £/ 

vj\n 



se^i v 7 (6,n)=l V 7 

™* E (^>w (1 „ gC ,)- 

6,n)=l ^ ™ 7 



n 

(6,n) 

E/ log log E/ 1 17 



log [/ 0(n) log 10 17 
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and the first inequality follows. The second inequality follows in the same 
manner by summing over s £ S*2 and using 

S(y 2 ; n, b) = ^(l + 0((l g^)- 10 )), 

where Wi = if (n, 1 + bR) > 1 and otherwise 



*-ne-§) n RM^)^- □ 



vj\nR w\R,zu]n 



Lemma 2.8. For some absolute constant b > 0, the following holds. For 
each Q ^ 3, there is a number oIq ~^>a (log(2) A for every A > 0, so that 
whenever \D\ < Q, oIq \ 8D, and logz ^ exp{(log Q) b }, 



n 



zu \ w 



Qlo S lo g Q <m ^ z 

Furthermore, 16 \ dQ and p 2 \ dQ for all odd primes p. 

Remark. From now on dQ will refer to the number in this lemma. The 
implied constant in dQ ^>a (log Q) A is ineffective for A ^ 2. With more work, 
the set of exceptional D may be reduced to those with f(D) = /(dg), where 
f(n) is the product of the odd primes which divide n to an odd power. The 
upper limit of z may also be extended easily. 

Proof. We may assume without loss of generality that D is square-free. 
If \D\ < 2 the result follows from the prime number theorem for arithmetic 
progressions, since (— ) then depends only on w modulo 8. Next suppose 
\D\ ^ 3. The desired bound follows from a version of the prime number 
theorem for arithmetic progressions deducible from the Landau-Page theorem 
and a zero density result of Gallagher (see [6] and [12]). The number dQ 
is the order of a real, primitive Dirichlet character whose corresponding L- 
function has a real zero (3 > 1 — co/logQ. If Co > is a sufficiently small 
absolute constant, the Landau-Page theorem asserts that there is at most one 
such primitive character of order < 8Q. Furthermore, if this character exists, 
Siegel's Theorem implies that dQ »a (logQ)" 4 for any A > 0. Let ir(y;q,a) 
denote the number of primes p < y, p = a (mod q). For n < 8Q not divisible 
by d Q , (a,n) = 1 and y > Q := Q lo s lo s<2, Theorem 7 of [12] implies 



7r(y; n, a) = — ^ (l + 0((log Q) 

0{n) logy V 



-6- 
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for some absolute constant b, < b < 1. Therefore, for z > Q. 



o- 



EJ_ = log log z - log log Qp ( log log z 
w cf>{n) \<j>(n)QogQ) 1 

Qo<ra<z 
vj=a (mod n) 

Write D = D1D2, where D2 is the largest odd positive divisor of D (i.e. 
D\ G {—2,-1,1,2}). Since L>2 ^ 3, quadratic reciprocity implies 

where i w G { — 1,1} depends only on tu modulo 8. Therefore, for numbers 
ie G {-1,1}, 

E i(^)= E ^ E (^) E ^ 

Q ( ,<ro^z e=l,3,5,7 \^a^SD 2 Q <m^z 

a=e (mod 8) zu=a (mod 8D2) 

_ log log z - log log Qp \ - . ^/ 8a' + e \ / loglogz 

Here we have used the fact that (7^) = if (a, D2) > 1. The error term is 0(1) 
by hypothesis, and the sum on a' is zero, since as a' runs from 1 to D 2 , 8a' 
runs over a complete system of residues modulo D2. The lemma now follows 
from the fact that 1 + h = exp{/i + 0(h 2 )} whenever — \ ^ h ^ \. □ 



3. Proof of Theorem 2, Part I 

Lemma 3.1. If A(m) = k and S(m) is true, then A(mm') = k + 2 for 
some m'. 

Proof. Suppose 4>(a\) = ■ ■ ■ = 4>(ak) = m. Suppose x is sufficiently large, 
so that x > 2«j for every i and there are 3> xj log 2 x primes q G (x/2, x] such 
that qro + 1 is prime. For each such q, the equation 4>(n) = q{q — l)m has at 
least k + 2 trivial solutions, namely n = g 2 aj (i = 1, . . . , k), n = q(qm + 1) 
and trivial n = 2q(qm + 1). Let B(x) denote the number of such q giving rise 
to additional solutions. If <fi(n) = q(q — l)m, then clearly q 3 \ n. If q 2 \ n, then 
4>(n/q 2 ) = m, which implies n is one of the trivial solutions. If q | n, q 2 \ n, 
then cf)(n/q) = qm. There is a prime r | (n/q) with q | (r — 1), i.e. r = + 1 
for some d \ m. If d = m then n is one of the trivial solutions mpq + 1 or 
2(mpq + 1). By Lemma 2.2, the number of q with dq + 1 for some other d is 
0(x(log x)~ 3 ). The last case, counting solutions with (n,q) = 1, is codified in 
a lemma, since we will need the argument in Part II. It follows from the next 
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lemma that the number of q with 4>{n) = q(q — l)m for some n with (n, q) = 1 
is 0(x(logx) -2 ' 1 ). Therefore, B(x) = 0{xi}ogx)~ 2A ) and hence for some q, 
A{q(q - l)m) = k + 2. □ 

Lemma 3.2. Suppose 6 = 1 or b is a prime less than y 10 . Let N(y) be 
the number of primes q £ (y/2,y] satisfying (i) qbm + 1 is prime, and (ii) for 
some n with (n,q) = 1, 4>{n) = mq(q — 1). Then 

N(y) <^ y{\ogy)- 2A . 



Proof. First, we remove from consideration those q for which q — 1 has a 
divisor s with s ^ exp{ (log logy) 2 } and P + {s) ^ s i/loglog2/_ gy L emma 2.4, 
the number of such q is 0(y/ log 10 y). Consider the equation 4>(n) = mq(q — 1). 
There is a number r | n with q \ r — 1. Say r = 1 + gso and let wo = 4>{n/r). 
For some factorization of to as to = toito-2, we have toi | so, TO2 | wq. Fix toi 
and TO2, and let s = so/mi, w = wo/™2, so that sw = q — 1. The number of g 
in question is at most the number of pairs (s,w) for which y/2 — 1 s$ sw ^ y, 
wm,2 is a totient (value taken by (p(n)), and the following three numbers are 
prime: 

(3.1) ws + 1 = q, (ws + 1)to6 + 1 = qmb + 1, (ws + 1)stoi + 1 = r. 

For brevity write E(c) = exp{(log y) c }. For fixed s, Lemma 2.2 implies that 
the number of w < y/s with each number in (3.1) prime is 

2/ (log logy) 3 
s log 3 (y/s) ' 

Therefore, the number of pairs (s,w) with s ^ E(0.89) is 0(y(log y)~ 2,1 ). 
Otherwise w y/E(0.89). By Lemma 2.1 and (2.1), for fixed w < y/£(0.89), 
the number of s with each number in (3.1) prime is 

y (log logy) 3 tt / 1 /to 2 — 4toi^ n 



^(logy)^ ' ^ 2 \ ro V ro 



where z = £(0.89) x / 2 . 

First, suppose that w 6 (l^"/2, £7] where is a power of 2 less than 2E(l/2). 
Let Q = 8mE(l/2) and recall the definition of c?q (Lemma 2.8). The number 
of w with oIq I (mf — Amiw ) is clearly 

4to!?7 , C/ C/ 

< — : + 1 < T, TTTTT < 



OogQ) 2 logz' 

For the remaining w, Lemma 2.8 implies that 
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Hence, since w is a totient, Lemma 2.3 gives 



1 (m\ — Amiw\\ (logy) 1 / 2 log logy 



w 11 V~ w I I ~~~~ (logf/) - 999 ' 

By (3.2), summing on U gives a total of 0(y(logy) -2 ' 16 ) possible pairs (s,w). 

Lastly, consider £7(1/2) s$ w ^ y/£7(0.89). The product in (3.2) for 
m > £7(0.48) is 0((logy) - 41 ). By our initial assumption about q, w has a 
prime factor > £7(0.49). Thus n/r has a prime divisor r 2 > £7(0.49). We can 
assume that r\ \ n, for the number of q divisible by a factor r 2 (r 2 — 1) for such 
large r 2 if O(y/£7(0.49)). Write w* = <p(n / (rr 2 )) , so that w = (r 2 — l)w*. Fix 
w* and put r 2 G (?7/2, f7], where U is a power of 2 in [£7(0.49), 2y/(sw*)]. By 
Lemma 2.7, 

EJ_ TT (^i 1 / raj-4miw*(r 2 -l) \\ ^ (log logy) 2 
r 2 ^ I w \ ZD J J logf/ ' 

Summing on U and again using (3.2) and Lemma 2.3, we see that the total 
number of pairs (s,w) is 0(y(logy)~ 2,25 ). Thus, for fixed mi,m 2 , the number 
of possible q is 0(y(logy)~ 2,1 ). Since the number of choices for (mi,m 2 ) is 
O(l), the lemma follows. □ 



4. Part II, the case (n,pq) > 1 

We require one more sieve lemma, needed only for a special case of the 
proof for Part II. 

Lemma 4.1. Suppose r, s, t, u are positive integers, and x, y, z are positive 
real numbers satisfying x ^ y ^ z ^ 2 and log(rstu) < log 3 x. Let 

w 9 ,h = (rg + l)(sh+ 1) 

and 

F(g, h) = {tvjg )h + l){ughvjg )h + l)ghw g>h . 

Then 

(log log a;) 6 



#{x < g < 2x, y < h < 2y : P (F(y, fc)) > z} < xy- 



(log z) 



Proof. Let g(ro) denote the number of solutions of F(g, h) = (mod tu). 
If £>(ro) = w 2 for some to < z then the left side is zero. Otherwise by the sieve 
fundamental lemma (Theorem 2.5 of [13]), 

q{zu)\ 



#{x<g^2x,y<h^2y: p-(F{g,h)) > z) « xy ]J M - 



TO 2 
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We next show that for w \ 30rstu, 

(4.1) -12 - 2ro 3/4 g(w) - 6m < -10 + 2ro 3/4 . 

From (4.1), the lemma follows at once, since then 

/ 30rstu \ 6 



n(>-f)«(|^) na-M' 

(log log xf 



(log z) 6 

To prove (4.1), suppose tu ^ z, w \ 30rstu. Let M±, M 2 , M 3 , M 4 denote the set 
of solutions mod w of gh = 0, = 0, + 1 = and ughw 9t h + 1 = 0, 

respectively. It is easily verified that M x n M 4 = M 2 n M 3 = M 2 n M 4 = 0, 
|Mi n M 2 | = |Mi n M 3 | = 2 and |M 3 n M 4 | < 2. Also, |Mi| = |M 2 | = 2ro - 1 
and |M 3 | = w — 1. By inclusion-exclusion, 

0(07) = I Mi U M 2 U M 3 U M 4 | = 5zu - 7 + |M 4 | - |M 3 n M 4 |. 

Since the number of solutions of q(qR + 1) = a (mod w) is 1 + ( 1+4 _ r a - R ), we 
have 

im 4 | = x; 1+ — — 1+ 



a=l 



a=l 



W \ \ "UJ 



/ 1 + 4ar \ / a \ / a — 4su 1 



/ 1 + 4ar \ / a \ / a — 4sm \ ,_, 

3+ E -5- b) H 5— =! — 3+B - 



By Theorem 1 of Davenport [5], we have \E\ < 2ro 3 / 4 and (4.1) follows. □ 

We next combine Lemmas 2.7 and 2.8 into an estimate which has enough 
generality for our applications. 

Lemma 4.2. Suppose 5 > 0, z > 100. Suppose A, B, Q, R, U are positive 
integers satisfying (logz) 6 < logQ < (logz) 3 , \og(ABRU) < log 3 z, U > 10, 
(AB, do) ^ dg 2 andQ^s AU(AU + B). Suppose either D{s) = A{A - sB) or 
D(s) = sA(sA — B). Let Sf\ denote the set of primes s < U, and 5^2 the set of 
primes s < U with sR + 1 prime. If 1 < c ^ 2 and £ {1, 2}, i/ien 



Proof. Partition =5^ into <5^', the set of s G =5^ for which rig f AD(s), and 

the remaining s. If D(s) = A(A — sB), then S p " is contained in a single 

1/2 

residue class modulo g, where g = dq/(dQ,AB) ^ dA . Recall that 16 \ dq 
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and d,Q is not divisible by the square of any odd prime. If D(s) = sA(sA — B), 
then (s,cIq) > 1 for at most uj(cIq) ^ logQ < log 3 z primes s (Here we use 
d,Q < 8Q). For other s, cIq\A(sA — B) for s lying in a single residue class 
modulo g = d'/(d'/,A), where d! = dQ/(dQ,A). Here (d',A) ^ 8, therefore 

1 1/2 

g ^ gC?Q . Thus, in every case 

\y"\ < - + 1 + log 3 z « - ^ . + log 3 z , 

g (logQ) 8 / d 
so the contribution to the sum in the lemma from s £ ,y" is 

0(C/(log z)" 6 + log 5 z) = 0(U/ log 2 U + log 5 z). 
By Lemma 2.8, ii s e Y' and z ^ Q := Q lo s lo gQ then 

n H(W«- 

Set y = [/i/(20(iogio g C7) 2 )_ Trivial i y W e have 

jJ^(W<<(^)M^)<'°- : 

Therefore, for any admissible z, Lemma 2.7 implies 

i (D( s y ' 



3c 



En ' 



c 



W \ W 

n H'™ 

U /logQV 4+4c 

< (i^U^J ^ lo ^) + ■ □ 

As mentioned in the introduction, for most primes p ^ x, the prime factors 
of p — 1 are nicely distributed. We prove a simple result of this kind below. 
For brevity write E(z) = exp{(log x) z }. 

Lemma 4.3. Let N ^ 10 be an integer, put 5 = 1/N and suppose x 1 ^ 10 < 
y ^ x. The number of primes p ^ y not satisfying 

n(p-l,E(jS),E((j + 1)5)) ^ (5 + S 2 )loglogx (0< j^iV-l) 

is 

< <<5 y(logx)- 1 - 53 / 3 . 



w 



Proof. Assume that x is sufficiently large with respect to 5. First, the 
number of p ^ y for which P + (p — 1) < yi/loglogs j s O(y/log 10 x) by Lemma 
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2.4. For the remaining p, let q = P + (p — 1) and p — 1 = qa. For fixed a, the 
number of q < y/a with ga + 1 prime is, by Lemma 2.2, 

y log log x j/(log logx) 3 
a log (y/a) a log x 

For each j, let = [E(j5), E((j + 1)5)] and let Bj be the sum of 1/a over 
a ^ y with at least (5 + 8 2 ) log logx — 1 prime factors in Ij. We have 



n(a,£(j<5),£(j5+5)) 



p \tt/ 1 + S 



p^x V ^ J P&h V 

pglj 

« (logx)^ 52 -^ 52 ) 10 ^^) « (logx) 1 - - 453 , 
and the lemma follows. □ 

Suppose that Hypothesis S(m) (following Lemma 1.2) is false. By Lemma 
1.2, for an unbounded set of x, there are 3> x/log 2 x primes s G (x/2,x] for 
which s = 1 +pqm, where p and q are primes and x 1 / 10 ^ p < x 1 / 2 . Denote 
by srf the set of such pairs of primes (p, q) . We will show that for x sufficiently 
large, there is some such pair (p, q) giving A(mpq(p — l)(q — 1)) = k + 2, with 

(4.2) 4>(n) = mpq(p — l)(q - 1) 

having only the trivial solutions n = p 2 q 2 ai (i = 1, . . . , k), n = pq{mpq + 1) 
and n = 2pq(mpq + 1). Before proceeding, we must cull from srf certain pairs 
(p, q) possessing undesirable properties. Let F(p) be the number of q with 
(p, q) G srf and let G(q) be the number of p such that (p, q) £ si ' . By Lemma 
2.2, we have 

(4.3) F(p)«— V. 



plog x ?log X 

By (4.3) and the Brun-Titchmarsh inequality, if K is a large enough constant 
(depending only on m), 

#{(p,q) £ g/ : p = l(mod 2 K ) or q = l(mod 2 X )} 

Denote by «e/' the set of remaining pairs. Define v by 2 U \ m, 2 U+1 \ m. If n 
has h distinct prime factors, then 2 h ~ 1 \ 4>{n). Consequently, if (4.2) holds and 
(p, q) G si', uj(n) ^ C where C = 2K + 1 + v. Let 

(4.4) N = 100([41og(2C + 2)] + 14 + 2C), 6 = 1/N. 
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For brevity, write Ej = exp{(log x) 1 ^^} for ^ j ^ N — 1 and let En = 2. 
Define 

(4.5) Qj = log log dj=dQjf (i = 2, . . . , N). 
Let SB be the set of (p, q) € srf' satisfying the additional conditions 

(4.6) d 2 | (p — l)(q — 1) implies d < log 4 x, 

(4.7) dp + 1, dq + 1, dpg + 1 are composite for each d \ m, 

except for mpq + 1, 

(4.8) p(p - 1) + 1, q(q - 1) + l jP9 (p - 1) + l,pq(q - 1) + 1, 

are all composite, 

(4.9) (m(p-l)(^-l),^)^4 /2 (j = 2,... ,7V), 

(4.10) pi (9-1), 



and 
(4.11 



j ft(p - 1, E j+1 ,Ej) ^{5 + 5 2 ) loglogx (0 < j < N - 1) 
1,^+1, ^ (<5 + <5 2 )loglogx (0 < j < TV - 1). 
First, the number of (p, q) failing (4.6) does not exceed 



E E E 5> <<xlogx E ^«*aogx) 2 - 9 . 

<2>log 4 ir a\d 2 n<x,a\n rf>log 4 x 

Since m is fixed, by Lemma 2.2 and (4.3), the number of pairs failing (4.7) is 
0(x/log 3 x). For (4.8), given q the number of p with pq(q — 1) + 1 prime is 
0(x(log log x) 3 /(q log 3 x)) by Lemma 2.2. Thus the number of pairs (p, q) with 
pq(q— 1) + 1 prime is 0(x(loga;)~ 2 ' 99 ). Likewise the number of pairs satisfying 
pq(p — 1) + 1 prime is 0(x(logx) -2 - 99 ). Also, by Lemma 2.2 and the prime 
number theorem for arithmetic progressions, for each q the number of p with 
p(p — 1) + 1 prime is 

< qlog 3 x II ( X p ( p )) < gl og 3 x' 

Thus the number of pairs (p, g) with p(p— 1) + 1 prime is 0(x/ log 3 x), and the 
same bound holds for the number of pairs with q(q — 1) + 1 prime. To deal with 
(4.9), for each j let Vj denote the set of numbers a < dj with (a, c/j) > dV 4 /m. 
Then 

|^|= ^ #n/d)< ^ n/d 

d|dj ,d>d,y 4 /m d\dj ,d>d 1 / A /m 

= d< r(dj)mdf 4 « df. 

d\dj,d<md 3 / 4 
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By Lemma 2.8, dj 3> (log Qj) 40 ^ ^ (logx) 40 and for x large, m < for 
every j. Therefore, the number of pairs (p, q) failing (4.9) is at most 




p=l+a (mod dj) q=l+a (mod dj) 



(logx) 3 

The number of pairs failing (4.10) is trivially 0(x 9 / 10 ). By (4.3), Lemma 4.3 
and partial summation, the number of pairs failing (4.11) is at most 

Thus, for x large, 

1 , 9 

(4.12) |^| ^ -|^'| > x/log 2 x. 

We will show that for most pairs (p, q) 6 the equation (4.2) has only the 
trivial solutions. We break this argument into two parts, the second of which 
is the most delicate and is proved in the next section. 

Lemma 4.4. The number of pairs (p, q) 6 S3 for which (4.2) has a 
nontrivial solution n with (n,pq) > 1 is 0(x(log x)" 2 ' 1 ). 

Proof. Consider a generic solution (n,p,q) of (4.2) with (n,pq) > 1. By 
the definition of p 3 \ n and q 3 \ n. If (n,p 2 q 2 ) = p 2 q 2 , then (j)(n/p 2 q 2 ) = 
m, which has the k trivial solutions n = p 2 q 2 cti (i = 1,... ,k). Next, if 
(n,p 2 q 2 ) = p 2 q, then 4>{njp 2 q) = mq. This implies that there is some prime 
r | n with q\(r — l)\mq, which is impossible by (4.7). Similarly, (n,p 2 q 2 ) = pq 2 
is impossible. If (n,p 2 q 2 ) = pq, then 4>(n/pq) = mpq, implying n has a prime 
divisor of the form dp + 1, dq + 1 or dpq + 1 for some d \ m. By (4.7), 
n = pq(mpq + 1) or n = 2pq(mpq + 1). If (n,p 2 q 2 ) = p 2 , then (j)(n/p 2 ) = 
mq(q — 1). By Lemma 3.2 (with b = p), for fixed p the number of possible 
q is 0((x/p)(log x)^ 2 ' 1 ); therefore the total number of possible pairs (p, q) is 
0(x(logx)~ 2A ). The same argument handles the case (n,p 2 q 2 ) = q 2 . 

Finally, suppose (n,p 2 q 2 ) = p (the symmetric case (n,p 2 q 2 ) = q is dealt 
with the same way). Then c/)(n') = mpq(q — 1), where n' = n/p, and there are 
two cases: (i) there is a prime r | n' with pq \ (r — 1); (ii) there are primes 
ri,r2 dividing n' with p\ (r± — 1) and q \ (r2 — 1). 

In case (i) write r = 1 + pqsQ and u>o = 4>(n' /r), so So^o = m (l — !)• Fix 
m± and m,2, where m = mim,2, m\ \ so and ni2 \ wo. Write w = wo/ni2 and 
s = so/mi, so that sw = q — 1. For fixed s ^ £'(3/4) and fixed p, the number 
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of w with sw + 1 = q, psmi(sw + 1) + 1 = r and pm(sw + 1) + 1 all prime is 

x (log logx) 3 
sp log 3 x 

Summing on s ^ £(3/4) and p yields a total of 0(x(log x)~ 2 - 24 ) possible pairs 
(p,q). The other possibility is that w < q/£(3/4). By Lemma 2.1 (with 
z = £(0.74)) and (2.1), for fixed w and p, the number of s is 

x (log log x) 3 



pw (logx) 2 - 22 



TT f i ^ ( m ^P 2 ~ ^ m iP w \\ 



Put p G (J7/2, £7], where {/ is a power of 2 between x 1 / 10 and x 1 / 2 . By Lemma 
2.7, the above product has average 0((loglogx) 2 ) over such p. Summing over 
U, it follows that for fixed w, the number of pairs (p, s) is 0((x/w) (log x) -2 ' 21 ). 
Since wm2 is a totient, Lemma 2.3 implies that the number of pairs (p, q) is 
0(x(logx)~ 2 2 ). Therefore there are 0(x(logx)~ 2 - 2 ) pairs (p, q) counted in case 
(i), since there are 0(1) choices for mi and m.2- 
In case (ii), write 

r 1 = l+psi, r 2 = l+qs 2 , n' = rir 2 t, w = <j)(t). 

Fix mi, m2,m3, where m = 171^2013, m\ \ s\, m 2 \ s 2 and 777,3 \ wq. Let 
S3 = si/mi, S4 = s 2 /m 2 and 77; = 77)0/7773. The number of possible pairs (p, q) 
is at most the number of quadruples (S3, S4, w,p) where p is prime, wins is a 
totient, and the following four numbers are prime: 

(4.13) WS3S4 + 1 = q, 

pszmi + 1 = n, 

(WS3S4 + 1)54171,2 + 1 = r 2 , 

(WS3S4 + l)pm + 1 = pqm + 1. 

Suppose U is a power of 2, x 1 / 10 ^ U ^ 2x x / 2 and U/2 < p < f7. We will 
count separately the quadruples (S3, S4, 7/j,p) in each of three classes: 1) those 
with w ^ £(0.99); 2) those with w < £(0.99), s > £(0.8); 3) those with 
w < £(0.99) and s < £(0.8). 

First, suppose w > £(0.99). By Lemma 2.2, given (S3, S4), the number of 
p with pssini + 1 prime is 0(U (log logx) 2 / log 2 x) and for each p the number 
of w with the remaining three numbers in (4.13) all prime is 

x(loglogx) 3 x(loglogx) 3 



£/s 3 s 4 log 3 (x/([/s 3 s 4 )) ^s 3 s 4 (log x) 2 - 97 ' 
By (4.11), 0(s 3 s 4 ) < 1.01 log logx, so 



V — ^ Y lM«(l O gx) 1+1 - 011 °g 2 

z — ' S3S4 ^-^ m 

J2(m)<1.01 log logx 
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Therefore, the number of quadruples in the first class is 0(x(log x) -2 ' 2 ), since 
there are O(logx) possibilities for U. 

Secondly, suppose w < £7(0.99) and S3 ^ £7(0.8). Applying Lemma 2.2 
twice, for each pair (54, w) the number of possible (p, S3) with p and the num- 
bers in (4.13) all prime is 

_x_ (log logx) 4 a; (log logx) 4 

WS4 (log 3 x)log 2 (x/(f/s 4 ttj)) WS4(l0gx) 4 - 6 

Since TO3 is a totient, by Lemma 2.3 and partial summation we have 

E 1 E 1 «( 1 °g-) L01 - 

w S4 

Summing over U gives 0(x(log x) -2 ' 58 ) quadruples in the second class. 

Finally, suppose S3 < £7(0.8) and w < £7(0.99) . By Lemma 2.2, for each 
pair (53, w), the number of (p, S4) with p, ps^rai + 1 = 7*1, S3S4W + 1 = q and 
(S3S4W + l)pm + 1 all prime is 

x(loglogx) 4 x(loglogx) 4 
S3 w log 2 xlog 2 (x/([/s 3 u>)) s 3 wlog 4 x 

We have ignored the fact that r2 is prime, because is a quadratic function of 
S4. As in the previous case, wm^ being a totient implies that the sum of 1/w is 
<C (logx) 0,01 . Since the sum of I/S3 is 0((logx)°' 8 ), there are 0(x(log x)~ 2 ' 18 ) 
quadruples in the third class. The number of triples (mi, 7712,7713) is 0(1), so 
there are 0(x(log x)~ 2 - 18 ) pairs (p,q) counted in case (ii). □ 



5. Part II, the case (n,pq) = 1 

In this argument we use for the first time the fact that the prime factors 
of p— 1 and q — 1 are nicely distributed (i.e. the full strength of (4.11)). Recall 
that Ej = exp{(logx) 1 ^' 5 } for < j < N - 1 and E N = 2. 

We may assume that any prime divisor of n which is ^ £7jv-i divides n 
to the first power, for the number of (p,q) with r(r — 1) | m(p — l)(q — 1) for 
some r ^ £tv_i is 0(x/£tv_i). Denote by the set of remaining pairs (p,q). 

Let r±, . . . , rM be the prime factors of n satisfying 

P+(r*)>E N ^, r*:= n - 1 

(ri - 1,P9) 

Clearly 1 < M < C. Let w = cj)(n/ (7*1 ■ ■ ■ r^)), so that 
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In our new notation, we have 

(5.1) r{ ■ ■ -r* M w* = m(p- l)(q - 1). 

We will partition the solutions of (5.1) according to the value of M (the number 
of "large" prime factors of n), the size of the largest prime factor of each r*, 
and the location of the factors p and q among the factors r% — 1 and w. For 
convenience, set tq = w+1. Define numbers i' and i" hyp \ (ry — 1), q | (rj» — 1). 
It is possible that i' = i". Let Ij = [Ej,Ej_\) and define numbers ji by 
P + (r*) £ The primes may be ordered so that 1 = j\ < j2 ^ ■ ■ ■ ^ Jm- 
The fact that j\ = 1 follows from the j = case of (4.11). 

By (4.8), it follows that p — 1 ^ r*, and q — 1 ^ r*„. Thus we may define 

(5.2) ^ = P+ f (P- W A ^ = P+ ^ 



In other words, S" is the largest prime dividing p— 1 and r*, to different powers. 
Define by 5" G If, S" G /-,», so that j' and j" measure the rough sizes of 
<S" and S" . By (4.11), if j' > 1 then the prime factors of p— 1 which are ^ £i 
also divide r*,, so that jV = 1. Similarly, if j" > 1 then = 1- Furthermore, 
if either j' > 1 or j" > 1 then there are at least two values of i with ji = 1. 
Thus, without loss of generality, if j' > 1 we may assume %' = 1 and if j" > 1, 
we may assume %" = 2. 

We count separately the number of (p, g) 6 giving solutions of (5.1) 
with M, ji, . . . ,jM, i', i", j' and j" fixed. The number of choices for these 
parameters is O(l) since it only depends on m. 

To reap the full benefit of (4.11), we break each factor in (5.1) into N 
pieces as follows: 

N N N 

m(p-l) = l[ir j , q-l = l[9 j , r* = ft J (Ki<M), 
j=i j=i j=i 

where nj, 0j and Qij are composed only of primes in Ij. We can suppose x 
is large enough so that P + (m) < En-i, i.e. m \ tin- Some of these variables 
may equal 1. In particular, for each i we have Qij = 1 for j < ji. By the 
definition of j' and j" we also have 

(5-3) *j = Qij (j<f), ®i = Q%3 U<J")- 

Let 



(5.4) 



aj = \{i > (j = I,-- - ,N-1), 

b = 1 : (ri - l,pq) > 1}|, 

bj = 1 : = j, (ri - l,pq) = 1}|, (i = 1, . . . , TV - 1). 
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For each j, aj is the maximum number of Qij which can be > 1. The estimation 
of the number of solutions of (5.1) proceeds in TV steps; at step j we count the 
possibilities for TTj,0j and the Qij, assuming the variables consisting of smaller 
primes have been fixed. The possibility that f > 1 or j" > 1 will force us to 
express some bounds in terms of products involving the Legendre symbol, and 
these bounds must be carried over many steps. For convenience, for each j ^ 1 
set 

(5.5) />';., II - Pj = U ^ T j = l[ e h . 

h^j h^j h^j 

If f > 1 define 

(5.6) D 1 = R 1 j,(R ld ,-4P j ,/m), H ± = JJ 
and if j" > 1 define 

(5.7) D 2 = R 2 j„(R 2J „-4T j „), H 2 = [J (i-L(^X\. 
Note that by (5.3), 

(5. 8 ) (El^j = ^i(^i-4(p-l)) ^ ^ = ^ r* 2 (r* 2 -A(q-l)) ^ _ 

Below are the estimates that we will prove: 

Step 1. Let w* , P 2 , T 2 and Ri >2 , . . . ,Rm,2 be given. The number of 
possibilities for tti, 6\ and f?i,i, . . . , £>m,i is 

« _^-(loglogx) 3 + 6o+6l+2ai (logx)- 4 - 6o - 6l+ ^ 3 + f>0 + bl+2ai )x 1 X 2 , 

where 

_ / 1 J' = 1 y f 1 J" = 1 



1 3+f „ fl ./ /' ./ 



Next, define 

v ' t//, } ' / • Z] ~\h 2 j j" 

Step j, 2 < j < iV - 1. Let u/*, P J+ i, T i+ i and . . . ,Rmj+i be 

given. Then 

Y i Z i < (ioglogx) 2 ^ +14 ^(logx) 2,5(1+,5)log ^ +25+ ^ (j ^ 1)+4 ^ 5 , 

where the sum is over all possible TTj, 9j and g±j, . . . , £>mj, and (j = 1 if j = f 
or J = j" and (j = otherwise. 
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Step N. We have 

Y YE^L « (loglogx) 14 (loga;) 5 5+ 2 5 (l +5 )log(2M + 2) ) 

where the sum is over w* , 7Tjv, 9n, and Qi t N (1 ^ i ^ M). 

Using the fact that 60 + ■ ■ ■ + bjv-i = M and combining all A" steps, the 
total number of solutions of (5.1) is 

< x(loglogx) 2M+45+ai (loga;) y , 

where 

(N-l \ 

Y = -4 - M + 25 2 logo,- + log(2M + 2) + <DT, 

AT-l 

X = 16 + 6 + h + 2ai + ^ + 2 + 2 log Oj) + 2 log(2M + 2). 

i=2 

Since a,j ^ M for each j, the coefficient of J 2 is at most 2N log(2M + 2). Also, 
60 < 2; thus 

AT-l 

ia log(2M + 2) + 14 + 2N + 2M + ^ (2 log a, + j6A. 

j=i 

The number of aj = i is jj+i — jj. Also, ji = 1 and 2 log 2 > 1; thus, if M ^ 2 
then 

AT-l M Af-1 

J] (2 log 0j - + j6j) < Yl * + 2 Yl &+ 1 " *) log * + 2 ^ - 1 - log M 

jr' = l 1=1 i=l 

M 

= 2(N - 1) log M + n + (l - 2 log (^r)) 

i=2 

M 

< 2JV logM + N (l " 21 °g (A)) 

i=3 

= A^(M-2 + 21og2). 

The above bound also holds when M = 1, since ctj = 1 for all i and bj = for 
j > 1. Therefore, by (4.4), we obtain 

y s$ -4 + 2 log 2 + 5(14 + 2M + 4 log(2M + 2)) s$ -2.603 

for every possible M, i' , i", j' , j" and ji, . . . , j'm- In conclusion, the number 
of solutions of (4.2) with (n,pq) = 1 is 0(a;(logx)~ 2 - 6 ). Together with (4.12) 
and Lemma 4.4, this completes the proof of Part II of Theorem 2. 

We now establish the bounds claimed for each of the A" steps. 
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Proof of Step 1 bound. Let I = {i : ji = 1} = {1, 2, . . . , cti}. Recall (5.5) 
and for i G I write 

(5.9) g i:1 = AiBi, Ai = (&,i,7ri), = (0i,i,0i). 
Then 

(5.10) tti=JJ^, 0i = Y[Bi 

iei iei 

and 

(5.11) P~ (jlAiB^ > El 

We need to count the number of (Ai, . . . ,B ai ) satisfying (5.11) such that the 
following 3 + &o + °i numbers are prime: 

iriPi/m + 1 = p, 

B X T X + 1 = q, 

(5.12) { (vriPi/m + 1)(6>iTi + l)m + 1 = pqm + 1, 
AiBiRi^ri - l,pq) + 1 = n, (i G I), 

-Ri,i(r-i - l,pg) + 1 = 7-j, (i = i' / or i = i" I). 

Recall that p \ (r^ — 1) and q \ {r{n — 1). All variables Ai, Bi occur linearly on 
the left of the above expressions except in a few cases. If i' G / (i.e. jV = 1), 
then Ai> occurs quadratically in the expression for (fourth line in (5.12)), 
and if i" G / then By* occurs quadratically in the expression for rj» (fourth line 
of (5.12)). There are O(l) possibilities for h = {i : A { > 1}, I 2 = {i : Bi > 1}. 
Fix Ji, ^2 and note that both I\ and I 2 are nonempty and 7i U I2 = I- Let 
J=\h\ + \h\. 

Place each variable which is > 1 into a dyadic interval, e.g. U{ < Ai ^ 2J7j 
for z G Ji, Vi < Bi < 2Vi for % G ^2, where each Ui, Vi is a power of 2. For 
some ordering of the J variables, we use a sieve to bound the possibilities 
for the first variable with the other variables undetermined. Then with the 
first variable fixed, we use the sieve to bound the possibilities for the second 
variable, with the remaining J — 2 variables undetermined, and so on. In most 
cases the variables may be ordered so as to apply Lemma 2.2 with each. The 
exceptions are when either Ay appears quadratically in the expression for ry 
and is isolated (none of the other J — 1 variables appears in the expression for 
ty), or when Byi has a similar property, or when i' = i" = 1 and Ay and By 
are isolated in the expression for ry. We therefore distinguish three cases: (i) 
ai = i' = i" = 1; (ii) / = j" = 1 other than case (i); (iii) / + j" > 2. 
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In case (i), we have J = 2, j' = j" = 1, bo = 1, b± = 0, 7Ti = Ai and 
01 = B\. By Lemma 4.1, the number of pairs A\,B\ satisfying (5.11) such 
that p = Ai(Pi/m) + 1, q = B\T\ + 1, pqm + 1 and AiBiRi^pq + 1 = r\ are 
prime is 0(L r iVi(loglogx) 6 (log£ ; i)~ 6 ). 

In case (ii), we first choose the numbers (if i' > 0) and 5j» (if i" > 0). 
Since j' = j" = 1, none of the primes in (5.12) are determined solely by these 
choices, so no sieving is required. The remaining variables Ai and B, L occur 
linearly on the left of the expressions in (5.12), so we may apply Lemma 2.2 
repeatedly. The number of solutions counted is 

« (n * n v-Viogiogx) 3+b »+^+ j (iog J E 1 )-( 3+b » + ^ +j ). 

\eh ieh 

For case (iii), suppose j' > 1, %' = 1, so that it\ = Qi\ = A\ and r\ = 
A\Ri^(A\Pi / m + 1) + 1. By Lemma 2.1 and (5.8), the number of possible A\ 
is <C ?7ii7i(loglogx) 2 (logi?i)~ 2 . Similarly, if j" > 1 the number of possible 
B2 is <C C/2-ff2(loglogx) 2 (log£'i) _2 . Lemma 2.2 is applied repeatedly to the 
other variables yielding a total of 

< (n ^ n v^iiogiogxf+^+^iiogE^-^+^+^x^ 

\eh ieh 
solutions counted. 

In every case, with I\ and I2 fixed, 

Ui,Viieh ieh 
and the bound in Step 1 follows. 

Proof of bound for step j 6 [2, N — 1]. All j. There are bj numbers i with 
ji = j and (ri — l,pq) = 1. Recall (5.2) and let I denote the set of such i 
with the additional property that S' \ Qij and S" \ Qij. For each i G I, let 
Wi = P + (Qi : j). Fix the partition of these i into Ji, the set of % with W\ \ 6j, 
and I2, the set of i with Wj | irj. There are O(l) such partitions. Also fix 

r := {g* hj ,... ,q* mj ;it*,9*}. 

Here, if j / j' and j 7^ j", we take ■ = Qij/Wi ii i £ I and £>|j = £>jj 

otherwise, 7r* = ftj/Ylieh ^ anc ^ = ^j/Y\ieh ^ = we a ^ so divide 
out the factor 5' from either 7Tj or 0j, as well as the appropriate Qij. If j = j" 
we divide out the factor S" in the same manner. Let 

(5.13) t = Ql j ---Q* MJ = ir*e* 
and note that by (4.11), 

(5.14) fl(t) < 25(1 + <5) loglogx. 
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Our procedure will be to fix r and the Wi (i G I), and sum YjZj/S'S" over 
S', S" if necessary. Then for each i E I, Wi and WiQ*jRij+i + 1 are prime, so 
by Lemma 2.2 and partial summation, 

(5.15) E ^ « (log log x) 2 (log x)^- 1 . 



For fixed t, let h(t) be the number of (M + 2)-tuples r with product t. At 
most Oj of the numbers Qij are > 1, thus h(t) is at most the number of dual 
factorizations oft into 2 and a,j factors, respectively. Actually if j < max(j',j") 
we can do better, but this bound suffices. Therefore h(t) ^ (2aj) n ^ . Since 
2 G" Ij, by (5.14) we have 

(5.i6) — * a f (1+s) loslosx E — 

sc (loga;) 25 ( 1+5 ) los ^ Y[(l - 2/p)- 1 
peij 

All that remains is to take care of the sum on S', S". 

Proof of bound for step j G [2, N — 1]; j ^= j' , j / j". Here there is no 
sum of S' or S", so we are done by (5.15) and (5.16). 

Proof of bound in step j G [2, N — 1]; j = j' ^ j" . Since / > 0, we have 
i' = 1. There are three cases to consider: (i) S' \ Qij, S' \ ttj; (ii) S' \ iTj, 
S' \ Qij, S' | Qij with ji < j; (iii) S' \ ttj, S' \ Qi :j , S' \ Q iyj with ji = j; for 
all cases suppose S' G (U, 2U] where U is a power of 2 in Ij. In case (i), (5.6) 
gives 

Di = AS'(AS' - 4J>v/m), 

with A = Ri j'/S' and Pj'/m fixed. Therefore, by (4.5), (4.9) and Lemma 4.2 
(with k = 1, c = 1, Q = Qj'), 



8 



Fl log Qj/ (log log z) 4 5 

1/ U<S%2U (7 10g U 

In case (ii), (5.6) gives 

D 1 =R ltjl (R 1 j l -S , B), 

with .£> = 4Pj/(mS') and fixed. As in case (i), by Lemma 4.2 (with k = 1, 
c = 1, Q = Qj')) we have 

Yl Yl fr < (log log x) 5 (log x) 5 . 

C7 U<S%2U 
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For case (iii), we also have 

D 1 = R 1:j ,(R 1J , -S'B), 

with B = APj/{mS') and R\ j> fixed. Here we must sum over S' for which 
S'R + 1 is prime, where R = Rij/S' is fixed. By (4.5) and Lemma 4.2 (with 
k = 2, c = 1), we obtain 

E E g « E ' og0 ',- ( '?f'°^ )4 « (logbg.i'dog.)'-^. 

The desired bounds now follow from (5.15) and (5.16). 

A symmetric argument using (5.7) proves the step j bound when j = 
J" / f- 

Proof of bound in step j; j = j' = j"; S' / S" . By (5.3), = 2. Define 
i*, i** by S' | Qi*j, S" \ Qi ** d . Put S' G (U',2U'], S" e ([/",2c/"] where U' 
and C/" are powers of 2 in [jj-Ej, Ej-i)- By the Cauchy-Schwarz inequality, 

^ i ( X /2 ( xl/2 

U',U"S',S" U',U" \S',S" J \S',S" 

For each double sum over S',S", we first fix 5" and sum over S", then sum 
on S" , bounding each sum with Lemma 4.2, Lemma 2.2 or the prime number 
theorem, as appropriate. For example, for the sum on 5" in the first sum we 
use Lemma 4.2 (with c = 2) if D\ depends on S' . If i* > 3 then S'R' + 1 is also 
prime, where R' = Ri*j/S' is fixed, so we use the k = 2 case of the lemma. 
Otherwise we use the k = 1 case. If D\ does not depend on S', we use Lemma 
2.2 if i* ^ 3 and the prime number theorem otherwise. For the sum on S", 
we use Lemma 4.2 only if D\ depends on S" but not on S' . If i** ^ 3 and 
i** 7= i*, then we use the k = 2 case of Lemma 4.2, since S"R" + 1 is prime, 
where R" = Ri**j/S" is fixed. Otherwise use the k = 1 case. If D\ does not 
depend on S", or it depends on both S' and S", we apply either Lemma 2.2, 
if i** ^ 3 and i** 7^ i*, or the prime number theorem, otherwise. In every case 
we obtain the bound 

^ 1 (logC/0 fe ^ 3 (logC/")^+3( lo g lo g^ > 

where fci = 1 if i** > 3 and i** 7^ i*, k± = otherwise, and k 2 = 1 if i* ^ 3 and 
^2 = otherwise. The same bound holds for the sum on H%, so 



/ < Z-^ S'S" 

U',U" S',S" 



< (log log x) 14 (log x ) 4 < 5 -( fe i+fe)(i-i5) . 



The desired bound again follows at once from (5.15) and (5.16). 
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Proof of bound for step j G [2, N — 1]; j = f = j" , S' = S" . Since 
{iTj,0j) = (Qi,j,Q2,j) = 1, either S' \ TTj,S' \ Q2j or S' \ 0j,S' \ gij. In either 
case, by (5.6) and (5.7), D\ and D2 both depend on S' (recall r and the W, 
for i £ I are fixed). As before we put S' G (£/, 2E7] where £/ is a power of 2 in 
[^Ej-i, Ej). By the Cauchy-Schwarz inequality and Lemma 4.2 (with c = 2, 
k = 1) we obtain 

E E ^<E^(E^) 1/2 (E^)" 2 

1/ U<S%2U U \ S' / \ S' / 

« (loglogx) 14 (logx) 4<5 . 

Combined with (5.15) and (5.16) gives the desired bound. This concludes the 
proof of the bounds for steps j = 2 through j = N — 1. 

Proof of the bound for step N. Let 

r = \VTjy , t) N ; g 1N , ■■■ , Q M ,N,w ), 

where the starred variables (except w*) are the quotients of the corresponding 
unstarred variables and S', S", as appropriate. As before let t = tt* n 9* n . We 
proceed as in steps 2 through N—l, except now there are no variables Wi to sum 
over. The sum over S' and S" is handled exactly as in steps 2 through N—l, but 
the sum over r of 1/t is handled a bit differently. Since 2 £ In, the procedure 
in (5.16) will not work. By (4.11), we have Q(t) ^ 25(1 + 5) log log x + f2(m). 
As in the other steps, given t, the number of dual factorizations of t into M+l 
factors and 2 factors is at most (2M + 2) n ^ . We thus have 

- < (2M + 2) 25 ( 1+5 ) lo s lo s* - 

«(logx) 5+2 ^) lo s( 2M + 2 ), 
and the bound for step N follows. 
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